-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Metricbeat ETCD overview dashboard #10591
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Tested it locally and seems to load as expected. My view is even more borking as I just have 1 node in the dropdown. ++ on the dropdown for the node selection.
4c0eaa5
to
56ecf99
Compare
jenkins, test this |
I'm really struggling trying to understand what's wrong with https://travis-ci.org/elastic/beats/jobs/489300684#L6610 @ruflin @jsoriano if any of you have some time can you help me with this, please? |
Can you reproduce this failure running |
Thanks @jsoriano but there's not |
Oh yes, sorry, I meant this one 🙂 let me try... |
Umm this actually works for me, have you tried to relaunch builds? |
jenkins, test this |
jenkins, test this before I get totally nuts 😄
3 builds, 2 dashboard error, 1 The job exceeded the maximum time limit for jobs, and has been terminated. on metricbeat... let's see the 4th... |
jenkins, test this please. Now we are in 4 builds, 2 dashboard error, 2 The job exceeded the maximum time limit for jobs, and has been terminated. on metricbeat... let's see the 5th... |
The CI failure is related to this change. The test_dashboards build has the following error:
What version did you use the export the dashboard? |
@ruflin I'm not doing anything very special with this dashboard. It has been done like the previous dashboards I did in recent weeks. 7.0.0 in Kibana and ES. Where did you see that error? I can only find this on Travis and I don't get any error on local.
|
I copied the above error from |
9fe0987
to
d114b03
Compare
jenkins, test this please |
d114b03
to
0996789
Compare
jenkins, test this please There's something wrong in my setup probably but here's the output of the same command ran in my local setup:
I really don't see anything very relevant apart from the fact that the dashboard was created with |
Ok. I found the reason why it is not failing on my local. It seems that if I have an environment launched with So if I ensure that the environment launched with
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this PR still ready for review or should we set it to in progress
?
61b7e96
to
a862997
Compare
a1aedc3
to
97a5963
Compare
Can we rename this dashboard as ETCD V2? A V3 dashboard should remove most of these metrics and include the metricbeat etcd metricset |
I have concerns about naming a dashboard with a version. The main reason is that we usually just "publish" what is already developed, so we maintain a snapshot of Beats. Once we have a new ETCD module in the future that supports v3, we must decide if we will maintain support for version 2 or not and how, which can be tricky and / or affect the name of some metrics which will affect the dashboard too. Another less important reason is that, right now, we don't have any version-specific dashboard so this could lead into confusion to some users, specially when the module itself is not version named either. I propose to maintain the name like this and then, once we have a v3 specific module, decide about what to do with it (delete, rename, modify, whatever) so the state of modules <-> dashboards is consistent. WDYT @odacremolbap ? 😉 |
#11280 (comment) My main issue is that everyone should be using Etcd V3 (same binary as V2) and almost noone should be using V2. When I tested the dashboard it was empty, it took me a minute to remember that this was targeting V2 only. I think we need to highlight this is V2 only. If we need to come up with only 1 dashboard, it would be a V3 one. |
Oh, but I didn't know that we already were fetching v3 metrics, so this changes everything. So now I'm more towards closing this PR without merging if v2 is going to be deprecated in the short term anyways. WDYT @ruflin ? |
If most users are on v3 already, I would say we only need a v3 dashboard. But if there is still a big portion (let's say more then 20%) on v2 for some reason, we could have 2 dashboards (we do that for other modules too). We could then have a drop down which allows to jump between the dashboards or a link list. |
We already ship the V2 API metrics and the Dashboard looks good. My take is going forward with this one, just because we worked on it and someone will probably benefit from it. If Etcd decided to include V2 API into the V3 binary, I assume there will be people using it, mainly apps that integrate with etcd and didn't modify their client libs when V3, which was fully incompatible, came out (just as a reminder, data created using one version of etcd can't be seen using the other version) It is hard to know who is still using V2 API. I would say less than that 20% because I'm guessing most of etcd users are kubernetes clusters --> V3 |
@sayden not sure if this might be related to my environment, will double check with a new setup in a while, but I'm having some flickering and lack of info with the dashboard At my local host I'm running 3 etcd instances and running a single metribeat where etcd is configured as
clients processes are reading and writing to each instance. I've had an scenario where shown data would flicker, usually showing no data, then at some refreshes showing some. Tried that from different browsers and computers, same results My first configured etcd instance seemed to have an influence on the graphs, while the other 2 didn't. Even nuking members 2 and 3, and leaving the cluster with no quota, there was no visualization changes for such critical scenario. As said above, I'll setup a more real world scenario and re-test. |
Closing this, if not many users are using v2 anymore and you found some issues, I think it's better to work on a v3 dashboard directly. Thanks for the comments! |
An overview dashboard for ETCD Metricbeat module. I have initially placed it in
7
folder for Kibana.I configured a cluster and send some random operations in a for loop to generate some data but the screenshot is still a bit empty of dynamism 😄
I also added an entry in the docs and a screenshot file.